Metadata Harvesting with R and OAI-PMH
نویسنده
چکیده
The Open Archives Initiative (http://www.openarchives.org/) develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content. One key project is the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH, http: //www.openarchives.org/pmh/) which provides “a low-barrier mechanism for repository interoperability” for archives (institutional repositories) containing digital content (digital libraries). OAI-PMH allows people (service providers, such as the ones registered with the OAI listed on http://www.openarchives.org/service/listproviders.html) to harvest metadata (from data providers, such as the ones registered with and validated by the OAI listed on http: //www.openarchives.org/Register/BrowseSites/). Data Providers administer systems that support the OAI-PMH as a means of exposing metadata. Service Providers use metadata harvested via the OAI-PMH as a basis for building value-added services. OAI-PMH, currently in version 2.0, defines a mechanism for data providers to expose their metadata. The protocol mandates that individual archives map their metadata to the Dublin Core (DC, http://dublincore.org/), a simple and common metadata set for cross-domain information resource description. OAI-PMH is a set of six verbs or services that are invoked within HTTP, returning the request results in XML format. The OAI-PMH specification can be found at http:// www.openarchives.org/OAI/openarchivesprotocol.html. Here, we summarize the basic facts and terminology. A harvester is a client application that issues OAI-PMH requests. A harvester is operated by a service provider as a means of collecting metadata from repositories. Repositories are network accessible servers that can process the six OAI-PMH requests, and are managed by a data provider to expose metadata to harvesters. OAI-PMH distinguishes between three distinct entities related to the metadata made accessible by the OAI-PMH:
منابع مشابه
Resource Harvesting within the OAI-PMH Framework
Motivated by preservation and resource discovery, we examine how digital resources, and not just metadata about resources, can be harvested using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). We review and critique existing techniques for identifying and gathering digital resources using metadata harvested through the OAI-PMH. We introduce an alternative solution that...
متن کاملInterweaving OAI-PMH data sources with the linked data cloud
The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) has found wide-spread adoption for exchanging bibliographic metadata. In parallel, the W3C’s Linked Data Initiative exposes and interlinks structured data from a variety of data sources on the Web. Since many of these data sources contain valuable information for institutional repositories (e.g., shared concept definitions,...
متن کاملmod_oai: An Apache Module for Metadata Harvesting
We describe mod_oai, an Apache 2.0 module that implements the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). OAI-PMH is the de facto standard for metadata exchange in digital libraries and allows repositories to expose their contents in a structured, application-neutral format with semantics optimized for accurate incremental harvesting. mod_oai differs from other OAIPMH i...
متن کاملUsing OAI-PMH and METS for exporting metadata and digital objects between repositories
Purpose – To examine the relationship between deposit of electronic theses in institutional and archival repositories. Specifically the paper considers the automated export of theses for deposit in the archival repository in continuation of the existing arrangement in Wales for paper-based theses. Design/methodology/approach – The paper presents a description of software that makes use of the O...
متن کاملThe OAI2LOD Server: Exposing OAI-PMH Metadata as Linked Data
Many institutions grant access to their metadata repositories via the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). However, this protocol has two significant drawbacks: it does not make its resources accessible via dereferencable URIs, and it provides only restricted means of selective access to metadata. The OAI2LOD Server handles these shortcomings by republishing meta...
متن کامل